Linear Regression from Strategic Data Sources
نویسندگان
چکیده
منابع مشابه
Classification with Strategic Data Sources
We consider a binary classification problem where the labels of training data are elicited from crowd workers who are strategic at exerting costly effort for coming up with a label or at reporting the resultant label. The goal of the learner is to learn an un-biased classifier that minimizes 0-1 loss. For a non-strategic setting, when the error rates of the two classes in the training data are ...
متن کاملData enriched linear regression
We present a linear regression method for predictions on a small data set making use of a second possibly biased data set that may be much larger. Our method fits linear regressions to the two data sets while penalizing the difference between predictions made by those two models. The resulting algorithm is a shrinkage method similar to those used in small area estimation. We find a Stein-type f...
متن کاملOptimum Statistical Estimation with Strategic Data Sources
We propose an optimum mechanism for providing monetary incentives to the data sources of a statistical estimator such as linear regression, so that high quality data is provided at low cost, in the sense that the weighted sum of payments and estimation error is minimized. The mechanism applies to a broad range of estimators, including linear and polynomial regression, kernel regression, and, un...
متن کاملFuzzy numbers from raw discrete data using linear regression
This paper focuses on modelling fuzzy numbers with meaningful membership functions. More precisely, it proposes a method to construct trapezoidal fuzzy number approximations from raw discrete data. In many applications, input information is numerical, and therefore, particular fuzzy sets, such as fuzzy numbers, hold great interest and relevance in managing data imprecision and vagueness. The pr...
متن کاملOnline Linear Regression of Sampling Data from Performance Event Counters
Sampling is a common way to collect execution information with performance event counters. However, the sampling data generated from performance event counters tend to be massive if sampling with high frequencies. Storing and processing a large amount of sampling data require much disk space and computing time. In this paper, we propose the online linear regression method to reduce the size of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Economics and Computation
سال: 2020
ISSN: 2167-8375,2167-8383
DOI: 10.1145/3391436